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Abstract 

The Kohn-Luttinger envelope-function method is generalized to the case of 
heterostructures with atomically sharp heterojunctions based on lattice-matched 
layers of related semiconductors with zinc-blende symmetry. For electron states 
near the T point in (001) heterostructures the single-band effective-mass equa- 
tion is derived, taking into account both the spatial dependence of the effective 
mass and effects associated with the atomically sharp heterojunctions. A small 
parameter is identified, in powers of which it is possible to classify the various 
contributions to this equation. For hole states only the main contributions to the 
effective Hamiltonian, due to the sharpness of the heterojunctions, are taken into 
account. An expression is derived for the parameter governing mixing of states of 
heavy and light holes at the center of the 2D Brillouin zone. 

PACS: 73.20.Dx, 71.25.Cx, 73.40.Kp 

1 Introduction 

The Kohn-Luttinger effective-mass method [| is widely used to describe electron 
states in semiconductors when applied are external fields varying smoothly over scales 
of the lattice constant a. Although the original method, based on the formalism of enve- 
lope functions, is applicable only to homogeneous semiconductors, various modifications 
of it have been used to describe the electron states in semiconductor heterostructures. 
In recent years there has been a revival of discussion on the applicability of the effective- 



mass method to describe electron and hole states in real nanostructures @-[22]. Many 



different modifications of the effective-mass method have been proposed, which apply 



to the case of a spatially varying effective mass m(r). There are two ways of con- 
structing the effective-mass approximation for heterostructures. 1) Derivation of the 
effective Hamiltonian for the envelope functions, defined over all space. By integrating 
the effective-mass equation, which contains this Hamiltonian, near the heteroboundary 
it is possible to obtain boundary conditions on the envelope functions if needed. 2) 
Derivation or, as is done much more often, postulation of phenomenological boundary 
conditions on the envelope functions at the heteroboundary. This approach makes use 
of symmetry arguments, continuity of the probability flux density, etc. But these argu- 
ments, as a rule, are insufficient to uniquely determine the boundary conditions. The 
second approach is applicable in the case of sharp heterojunctions; all the models in 
which such boundary conditions were obtained dealt with mathematically sharp het- 
erojunctions between the left-hand and right-hand materials. It is implicitly assumed 
that the envelope function on the left side (on the right side) of the heteroboundary 
satisfies the same equation as in the bulk case for the left-hand (right-hand) material. 
In this case the very delicate problem of the accuracy excess arises, which, by the way, 
has not been discussed to this date: the boundary conditions should hold with the same 
accuracy as the equations for the envelope functions hold. 

Below we will follow the first approach, in which it is possible to rigorously treat the 
problem of accuracy, see Sec. 3. 

It is well known that two main problems arise along the path of constructing a 
common equation for the envelope functions. The first is the problem of ordering the 
momentum operators in the kinetic-energy operator (due to the non-commutativity of 
the momentum operator and the function m(r)), on the form of which the solutions of 
the effective-mass equations can depend substantially ||18|| . The second problem is that 
the effective potential near a heteroboundary, as a rule, is not a smooth function on 
scales of the order of a. This calls into question even the validity of using differential 
equations in the method of envelope functions. Let us discuss these problems in the 
indicated order. 



1.1 Account of the spatial dependence of the effective mass 

A necessary condition for the applicability of the one-band equations for the envelope 
functions [one equation is understood here, valid near the bottom of the non-degenerate 
conduction band, or a system of equations for the degenerate valence band] used in the 
effective-mass method is "shallowness" of these states: their energy, measured from the 
band edge, should be small in comparison with the inter-band energy. Therefore, bearing 
in mind the one-band version of the effective-mass method, we restrict the discussion 
to heterostructures consisting of related materials, where the band offsets are small in 
comparison with the characteristic band gaps; this means, as a rule, that other band 
parameters of the semiconductors differ only slightly. Let us consider the first problem, 
which arises even for heterostructures whose chemical composition varies smoothly on 
scales of the order of a. As the zeroth-order potential we choose the potential of the 
crystalline lattice, continued to all space, of one of the materials of the structure [this 
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is not a unique choice, see Ref. J/J], and we treat the difference between the potentials 
of the lattices of the remaining semiconductors as a small perturbation. Following the 
approach of Luttinger and Kohn and deriving the many-band k • p system of equations 
(see, e.g., Ref. ||), we can next attempt to solve the problem of the correct order of 
the non-commuting operators in the kinetic-energy operator for the one-band equations. 
But here yet another problem arises. 

Reduction of the many-band system of equations to a one-band effective-mass equa- 
tion is achieved by eliminating the small envelope functions from the many-band k • p 
system in favor of the large ones by means of some procedure. We make a small de- 
parture here and make use of a formal analogy between the relativistic Dirac equation 
and the many-band kp system of equations for the envelope functions |[23|| , which 
is most simply seen in the two-band approximation (the conduction band and the non- 
degenerate valence band). In the relativistic theory there are two approaches to deriving 
an equation for shallow electron states. One of them consists in eliminating the small 
positron component of the wave function by the method of substitution. In this case, 
we obtain either an exact equation for the electron component, which is not an eigen- 
value equation, Ref. [24|, Ch. 20, Sec. 28, or an approximate equation whose Hermitian 



nature must be checked separately p5| . The second approach is a Foldy-Wouthuysen 
transformation, an approximate unitary transformation of the Dirac equation, Ref. [|24| . 
Ch. 20, Sec. 33. 

In our case the first approach is comparatively simple to realize only in the two-band 
approximation, see, e.g., Ref. f26fl . In a treatment of the contribution of distant bands, 
and this is necessary, in particular, for a valid description of the heavy holes, a number of 
problems arises. Thus, the authors of Refs. [[TJj and |2(| were able to take into account 



only a few of the first-order corrections to the "standard" Kohn-Luttinger equation 
with position-independent effective mass [the small parameter here is the ratio of the 
characteristic band offset to the characteristic inter-band energy]. However, treating 
the expression obtained, for example, in Ref. pi for the position-dependent effective 



mass, it can be shown that the effective mass of the edge of the conduction band of 
one of the non-basis semiconductors does not contain inter-band matrix elements of the 
perturbation potential, obtained using Bloch functions of the band edge of the basis 
crystal (see Sec. 4.1 below). It can be easily seen that this is equivalent to the poorly 
justified approach of neglecting the difference between the inter-band matrix elements 
of the momentum operator or, what is equivalent, the difference between the Bloch 
functions for the materials making up the structure. 

Hence it follows that we should give special attention to the problem of taking distant 
bands into account. Efforts at solving it by direct elimination of the small envelope 
functions by the method of substitution, in addition to its laboriousness, lead finally to 
a non-Hermitian equation, whose solution is still in need of a valid interpretation. 

Below we will follow the second approach, i.e., we will apply the unitary trans- 
formation eliminating the small envelope functions [I], Since we are considering 
heterostructures consisting of related materials, the standard effective-mass method will 
play the role of a first approximation. An account of the spatial dependence of the effec- 
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tive mass necessitates treating corrections to the standard theory, where it is necessary 
to take into account all corrections of the same order without the accuracy excess. 

In order to understand what corrections should be taken into account, let us turn 
to the relativistic analogy with the hypothetical Dirac equation containing the inhomo- 
geneous gap 2m(r)c 2 , where c is the speed of light in vacuum (see Appendix A). The 
ordinary one-band effective-mass equation is an analog of the non-relativistic Schrodinger 
equation. It is important, however, that the effective mass in the two-band approxima- 
tion is proportional to the local band gap E g (r) [this is valid if the effective mass is 
formed mainly by k • p interaction], and its relative variation 5m/m ~ 5E g /E g . Since 
the correction to the kinetic energy describing the spatial dependence of the effective 
mass will have a "relativistic" character, the desired equations for the heterostructures 
will be analogous to the Schrodinger equation with all relativistic corrections — both the 
usual ones (the contribution of nonparabolicity of the dispersion law, proportional to 
p 4 , where p is the momentum operator; the contribution of the spin-orbit interaction; 
and the so-called Darwin term, proportional to the second derivative of the potential en- 
ergy) and a new pseudo- relativistic correction describing Sm(r). Of course, the present 
arguments are valid for describing states whose energies, reckoned from the band edge 
of any of the materials making up the structure, are of the order of the band offset. 
The case of a very small band offset, where the offset is small in comparison with the 
energies of the states is quite trivial: depending on the energies of the states under con- 
sideration an account of the spatial dependence of the effective mass can require treating 
terms with higher and higher powers of the momentum operator. We will not consider 
such a situation. In this sense, introducing a term proportional to the fourth power of 
the momentum operator into the effective-mass equation is a necessary condition for 
a consideration of the effective mass spatial dependence. Note that for homogeneous 
semiconductors an effective-mass equation analogous to the Schrodinger equation with 
first relativistic corrections was discussed already in Ref. f27j (Sec. 27). 

A typical shortcoming of previous works dedicated to a generalization of the effective- 
mass method to electron states in heterostructures is that they take account within the 
framework of perturbation theory of only some of the terms of a given order. Thus, 
Refs. P]-p2]] take account of the spatial dependence of the effective mass parameters, 
but neglect corrections oc p 4 . In Refs. p, [7|, [10], [14[], dedicated to deriving the one- 
band equations for the envelope functions directly from a many-band k ■ p system, the 
main error, leading to an invalid result, is an incorrect estimate, according to which the 
contribution of the k • p interaction terms [i.e., the terms hkp nn i /m , where mo is the 
free electron mass and p nn / is the inter-band matrix element of the momentum] is of 
order the contribution of the potential energy terms [by potential energy here we mean 
the difference in the periodic potentials of the semiconductors making up the structure, 
also treated in the perturbation theory context]. In the case of a smooth hetero junction 
the correct procedure for deriving the one-band effective-mass equation near the bottom 
of the conduction band with all the above-indicated contributions taken into account 



was followed in Ref. 28 
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1.2 Account of atomically sharp heterojunctions 

The second problem requiring careful study is the non-smooth nature of actual het- 
erojunctions, where the transition from one material to the other occurs over scales of 
the order of a. In this case, first, Leibler's many-band k • p system ||, where smooth- 
ness of the potential was a necessary condition for validity of the system, is in need of 
refinement, and second, the problem of transforming to r space from the region of k 
space bounded by the first Brillouin zone is more complicated jj]]. It is also necessary 
to analyze the consequences of the unitary transformation eliminating distant bands. It 
is important to estimate the error that enters at each step. An estimate of this error 
either gives us confidence in the absence of an excess of accuracy or it challenges the 
validity of the effective-mass approximation. In the works known to us which treat sharp 
heterojunctions, such an estimate is lacking. References fl4| and ||20| , for example, only 



point to its smallness, and Ref. [[L9) made some approximations whose accuracy were 
not even estimated. 

Thus, we can formulate the following steps in the construction of an effective-mass 
approximation for heterostructures: a) obtaining a many-band k • p system of equations 
for the envelope functions taking proper account of possible sharpness of the hetero- 
junction; b) reducing this system to one-band equations with the help of the unitary 
transformation to k space, transforming to r space and transforming the resulting equa- 
tion to differential form; c) estimating the accuracy of these transformations. Following 
this scheme, step a) is realized in Sec. 2. The equations include contributions associated 
with the non-smoothness of the heteroj unction on scales of order a which are treated 
within the framework of an approach similar to that used in Ref. [27| to describe the 



short-range part of the impurity potential. Section 3 shows that a common differential 
equation over all space for the envelope functions for sharp heterojunctions exists, and 
its accuracy is determined by the procedure of transforming to one-band equations in r 
space. One-band equations are considered in Sec. 4.1 (the conduction band) and in Sec. 
4.2 (the valence band). Section 4.3 is devoted to heterostructures with super-thin layers. 
It is shown that additional contributions to the "standard" effective-mass equation can 
be classified by powers of the small parameter ka m , where A; is a characteristic value of 
the quasi- momentum of the state and a m is of the order of the lattice constant. Section 
5 constructs a hierarchical scheme of effective-mass equations, the nth level of which 
corresponds to taking account of these additional contributions up to (ka m ) n inclusively. 
The zeroth order of the hierarchy (n = 0) corresponds to the "standard" effective-mass 
equation with position-independent parameters. At the first level of the hierarchy each 
heteroboundary gives an additional 5-function contribution to the potential energy. Only 
at the second level of the hierarchy does spatial dependence of the effective mass appear, 
along with corrections associated with weak nonparabolicity of the spectrum and het- 
erointerface terms of the spin-orbit interaction. At higher levels of the hierarchy nonlocal 
contributions arise, and the one-band differential effective-mass equations do not exist. 
Results are compared with the conclusions of other authors. Brief reports on the results 



obtained have been published elsewhere [pSfl-|j3~I]. 
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2 Many-band k • p system of equations for envelope 
functions in the case of a sharp (001) heterojunc- 
tion 

Let us consider a heterojunction formed from related lattice-matched semiconductors 
with zinc-blende structure. The Schrodinger equation without relativistic corrections, 
which will be taken into account below, and in the absence of external potentials has 
the usual form 

^ P 1 Z7(r) J *(r) = e*(r). 



\ 2 ™c 

Here U(r) = U is the crystal potential of the heterostructure. To start with, we will use 
the following model of this potential: 

U=U 1 + G(z)[U 2 -U 1 ] = U 1 + G(z)6U, (1) 

where U\ = U\(r) and U2 = t^2( r ) are periodic (with the same period) potentials, contin- 
ued through the entire structure, of the left-hand and right-hand materials, respectively, 
the z axis is directed perpendicular to the plane of the heterojunction, G(z) is the form 
factor of the heterojunction 

G(z) \ z< _ d = 0, G(z) \ z>d = 1; 

and the width of the transitional region of the heterojunction is 2c? [non-one- dimensiona- 
lity of G(z) will be taken into account below]. 

It is natural to treat the potential G(z)5U as a perturbation. As the basis for 
expanding the wave function we use the complete orthonormal set of Kohn-Luttinger 
functions {w n0 e lk ' r }: 

¥ ( r ) = £ / F n , (k') e^X'o d 3 k', (2) 

n' 

where u n o = u„o( r ) is the periodic Bloch amplitude for the edge e n0 of the nth band of 
the left-hand crystal at the T point of the Brillouin zone (in the non-relativistic limit), 

( P 2 \ 

+ U 1 )u n0 = e n0 u n0 . 



2m 







The sum in Eq. (0) is over all bands, and the integral is over the Brillouin zone here and 
in what follows unless otherwise stated; ^(k) is the envelope function for the n band 
in k space. Following the standard procedure 0, we obtain a system of k • p equations 

(^ + g^„ (k)+ £5p^, (k) + 

+ E / M nn' (k, k') T n , (k') d'k' = tT n (k) J (3) 
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M nn , (k, k') = Y, Cfg (h -k' z + K zj ) 5 (k,| - k|, + K|y) . 



J 



Here p nn / = (n | p | n') and C" n ' = {^n | 5Ue lKjr \ n'\ and the matrix elements of a 
periodic operator f are defined as follows: 

(27T) 



(n | f | n'} = — f u* n0 iu n > d 3 r, 

i 2 J cell 



where Q is the volume of the unit cell; k|| = (k x ,k y ,0), and K Z j and Km.,- are the 
components of the vector Kj of the inverse lattice perpendicular and parallel to the 
plane of the heteroboundary, respectively; Q{k z ) is the Fourier transform of G(z). Let 
us analyze the expression for the matrix elements A4 n n>{k, k'): 

M nn , (k, k') = Y c fG (h -K + K zj ) 5 (k,| - kj,) + 

3 (K||j=0) 

+ E C-'g(^-^ + ^)5(k||-k( | + K| b .). (4) 

3 ( K lli^°) 

The second sum in Eq. (|J) describes transfer processes in the two-dimensional Brillouin 
zone, when the projections of any pair of vectors k and k' from the bulk Brillouin zone 
onto the plane of the heterojunction satisfy the condition kji — k|| = K||j 7^ 0. For a 
hetero junction of arbitrary orientation such transfer processes exist. However, for the 
orientation of interest to us — the (001) orientation — their contribution to the desired 
equations for the envelope functions disappears (see Appendix B): 



M nn , (k,k / ) = <5(k||-k| 



g (k z - k' z ) 8u nn , + ]T cfg (k z -k' z + 



(5) 



where we have introduced the notation: Kj = (Air/a)j; j = ±1, ±2, ...; and 5U nn i = Cq 71 ' 
If G(z) is a sufficiently smooth function, a <C d, and we are interested in states with 
k z 2n/a, where k z is a characteristic value of the quasi-momentum of the state, we 
can neglect the second term inside the brackets in Eq. (||) and as a result obtain the well- 
known set of equations for the envelope functions || |28|. In the case of an atomically 
sharp heterojunction, on the other hand, it is possible to proceed in the spirit of the 
method used in Ref. |27| to describe a short-range impurity potential. We introduce the 
function G'(z) = dG(z)/dz, localized on the heteroboundary, \z\ < d. Then for j ^ 
we have: 

+00 

g (k z -k' z + Kj) = — [ G (z) e-W'-K+Ki)* dz = 

Z7T J 

— 00 

1 1 + r° 

G'{z)e- l{k >- k '* +K i )z dz = 



2wi k z -k' z + Kj 
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-d 



and we can write the sum in Eq. (|5|) in the form of an expansion in powers of (k z — k' z ): 

-D snn >. (7) 



j^O s=0,l,2,... Z7r 

The constants in the expansion (|7]) have the form 

d 

D nn> = E<T'^F I G'(z)e- lK ^dz, 

«~ = w-k, I G ' (z)e ~ iK " H - fa ) • 

The present approach fundamentally allows one to treat even mathematically sharp 
heterojunctions, since the necessary convergence of the coefficients D\ nn i is ensured by 
the property C"™ — > as Kj — > oo [but a physically realizable heterojunction cannot 
be mathematically sharp]. 

Let us consider the corrections associated with sharpness of the heterojunction. Sim- 
ple estimates show that terms proportional to D Qnn r, Di nn >, . . . can give corrections not 
greater in order of magnitude than ak z , (ak z ) 2 , . . . , respectively. Our goal is to ob- 
tain one-band equations with position-dependent effective-mass parameters, which is 
achieved by taking account of corrections of order (Xk z ) 2 to the standard approxima- 
tion. Here we have introduced a characteristic "two-band" length A = h(2mE g )~ 1 ^ 2 . For 
GaAs, for example, A ~ 6A. We will make use of the smallness of the parameter d ■ k z , 
which will allow us to write down the final equation in quite simple form (see below). 
Thus, three quantities having the dimensions of length, a, d and A, in combination with 
k z form three parameters whose smallness is employed in the present method. In our 
view, the situation a < d < A, is the most realistic, being realized in semiconductor 
heterostructures with sharp heteroboundaries. Thus, the parameter Xk z can be taken as 
the main small parameter of the problem and the sum in Eq. (|7]) can be restricted to 
terms with s = and s — 1. 

As a result, the many-band system of k • p equations (||) takes the form 

= (e n0 - e + T n (k) + £ ^^T n , (k) + 

+ E / G ^ - K) SU nn , + ^Aw + ^^D lnn , T n , (k' z , k,|) dk' z . (8) 

n' L 

Here we can distinguish different contributions of the perturbation potential: the con- 
tribution of the "smooth" part is represented by the first term inside the brackets, and 
the contribution of the "sharp" part is represented by the second and third terms. 
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2.1 Account of the 3D character of the form factor 

Let us now consider instead of Eq. (|]) a more realistic form of the heteropotential: 

U = U 1 + g(r lb z)5U, (9) 



where r|| = (x,y, 0). By definition, g(ru,z) \ z< _ d = and g(ru,z) \ z>d = 1, and the func- 
tion g(r\\,z) is periodic in r||. For our case of (001) heterostructure the unit translation 
vectors in the plane of the heteropotential are a x = (1, 1, 0)a/2 a 2 = (—1, 1, 0)a/2. The 
sites of the two- dimensional inverse lattice with basis vectors bi = (l,l,0)27r/a and 
b 2 = (—1, l,0)27r/a are projections of the sites of the three-dimensional inverse lattice 
onto the (001) plane. 

We expand g{r\\, z) in a 2D Fourier series: 



where the summation index I is defined so that the vectors K/ determine the sites of the 
indicated two-dimensional inverse lattice, and 



1>D 
cell 



The integration is over a unit cell of the 2D lattice with area Q\\ = a 2 /2. Denoting 
G(z) = Go(z), we obtain for the perturbation potential 

g (r,,, z) 5U = G (z) 5U + 5UY, Gi («) e iK;r|1 • (10) 

It can be seen that the simple model ([I]) takes into account the first term in the sum 
(TO). The functions Gi(z) for I ^ are nonzero only for z G [— d, d]. Therefore, the 
left-hand side of Eq. (0) will include an additional sum of interface contributions: 



E/^L (k,k')^ (kO d 3 k>, 



where 

ML (k, k') = £ -^-3 f G^^e-^'+^dh. 

l^Q;j l^TTj J space 

For states with \k x \ + \k y \ < n/a (see Appendix B) we obtain 



Ml, (k, k') = 6 (k„ - k|,) £ -j^5 Kl>Kl]j J Gi (z) e -*<*.-*+*«* dz 



d 



rmn' " . 

^(k||-k||) E / dz U d\9{r\^)e-^e-^- k '^ K ^^ 



For a smooth heterojunction it is necessary to retain only terms with K z j = in the 
sum and develop the expression in the standard way [[|, since the equations for the 
envelope functions in the r representation will include an additional interface potential. 
Note that while the smooth part of G(z)8U ensures mixing of states of the same crystal 
symmetry [i.e., the local symmetry of the smooth part of the perturbation potential 
G(z)8U coincides with the symmetry of the bulk crystal], this additional potential also 
ensures mixing of states of different symmetry. We are interested in the case d ■ k z <C 1, 
and this obviates the necessity of separating effects of this additional potential into 
contributions of smooth and sharp parts and allows us to use the expansion 

ML(k,k') = 5(k ll -kj | ) £ ^=^IU (11) 

s=0,l,2,... Z7F 

where 

Dlw = E if- / dz f d 2 r ll9 (r,|, z) e~^\ 

j(K||^0) "II l d 

d ] L = E tr / dz L d2 ^ 9 ( r ii' z ) e ' lKjr H*) ' • • • 

In the expansion (|TTD it is necessary to keep only the first two terms; the terms propor- 
tional to D\ nn i Di nn i can give contributions of order d ■ k z and (d ■ k z ) 2 , respectively. 

We have shown that taking the three-dimensionality of the form factor into account 
(see Eq. (|9|)), causes no special difficulty for analysis, and we now make an important 
observation which will allow us to use the simple model (|l|). The function g(ru,z) has 
lower symmetry than G(z). Specifically, it is invariant under symmetry transformations 
from the point group C2 V - But the complete perturbation potential in both models, ([!]) 
and (|9]), possesses the same symmetry, both its point-group symmetry (C2 V ) and trans- 
lational symmetry in the plane of the heterojunction. Information about C2 V symmetry 
will be preserved, however, only if the contribution of the sharp part of the poten- 
tial G(z)8U in model ([!]) is taken into account. Therefore, using model @ instead of 
model (|I]) does not give anything qualitatively new, and only leads to renormalization of 
some parameters, namely those that are negligibly small for the smooth heterojunction 
in model (|]). The expressions for these parameters are very complicated, so in what 
follows we will stick with model ([[]). 

2.2 Account of relativistic corrections 

Let us now consider relativistic effects. We restrict the discussion to the spin-orbit 
interaction. The remaining relativistic contributions only influence the values of the 
constants that we will obtain. We assume that within the framework of perturbation 
theory the characteristic parameter of the spin-orbit interaction and also the difference 
of this parameter for the left-hand and right-hand crystals are less than or of the same 
order as the characteristic band offset. The expansion of the total wave function, as 
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before, is given by expression (|2]). Omitting intermediate manipulations, we give the 
resulting, quite lengthy k ■ p system of equations which take into account the spin-orbit 
interaction 

h 2 k 2 \ _ , ^ hp nn > • k 



i*) T n (k) + e (k) + 

2m j m 



+ E ^nn' / Q {K ~ K) Fn> fe k||) d*£ + E 



h(n\ [VUi x p] |n') ■ a 
4m 2 , c 2 



T n . (k) + 



+E 



ft(n| [V5C/ x p] \n') a 



ArriQC 2 



J g (k z - k' z ) T n , (A4k„) 



+E 



ft (n| [n5?7 x p] |n') ■ a 



ArriQC 2 



J %{h z - k' z ) Q (k z - k' z ) T n > (k' z , ky) dk' z + 

+ E j n>wuj;u\\nr<> s (k|| _ K)g (fe _ K) Tn , (k , } A , + 



1 n _i_ ^£ ^ 

—UOnn' H ^ 

Z7T Z7T 



+ E / ^ ( k ll - k '|) x *] • Bcw^' (k') d 3 fe' = 0. 

n' 

The vectors S 0n n', Si nn / and B 0rm / have the following form: 

h (n\ [V (e^'SU) x p] |n') '' 



Sonn' — E 



2^2 



G'(z)e-^ z dz 



(12) 



'Inn' 



E 



h (n\ V(e iK i z 5U) x p \n') 



AKjITIqC 2 



G'(z)e-' lK > z zdz 



-d 



E 



ft (ra| e %K ^ z [V5U x p] |n') 



AiK 2 rriQC 2 



G'( z )e- iK > z dz: 



B 



Onn' 



E 



ft (n| V U^'SU) |n'} 



/ G'(z)e- iK > z dz. 



Here n is the unit vector along the z axis, nG'(z) = VG(z), and a are the Pauli matrices. 
On the left-hand side of Eq. flT2"D the fourth term describes the spin-orbit interaction in 
the potential of the basis semiconductor; the fifth, sixth, and seventh terms are due to 
the smooth part of the perturbation potential. The terms proportional to Sonn', Si nn / 
and B nn' are due to the sharpness of the potential. In Ref . , in a consideration of 
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the state of the conduction band in heterostructures with smooth heteroboundaries, we 
neglected the sixth and seventh terms on the left-hand side of Eq. fljjD as small. We 
noted that in second-order perturbation theory they, together with ftk- p nn ,/m , give 
a correction only of order (\k z ) 2 m/mo, which can be neglected as the effective mass is 
small in comparison with itlq. For the hole states, on the other hand, m/rriQ is able not 
to be a small parameter. 

We do not consider k-linear contributions of the spin-orbit interaction due to the 
potential U\. They give corrections of order (Afc z ) 3 (third-order corrections, along with 
two terms of the form hk ■ p nn >/rno) similar to the contribution responsible for remov- 
ing the spin degeneracy in the conduction band of the bulk semiconductor (we neglect 
terms of this order), and for the valence band it is well known that to first order the 
contribution of these terms is small, and to second order, along with hk ■ p nn //m they 
only renormalize the effective-mass parameters. 

As for the contributions from the sharpness of the heterojunction potential to the 
spin-orbit interaction, the terms proportional to S 0nn ', can give corrections of order ak z , 
while the terms Si nn / and B 0nn ' can give corrections of order (ak z ) 2 . 

It is trivial to generalize to the case of many heterojunctions. In this case it is con- 
venient to choose the coordinates of the heteroboundaries so that the distances between 
them be integer multiples of a/2, so that the phase factor of each of the expansions of 
the type (0) is equal to unity. 

3 Problem of transformation of the effective- mass 
equations from quasi-momentum space to coordi- 
nate space 

In the following section we obtain the one-band effective-mass equations for the conduc- 
tion band and the valence band. But first of all, we must discuss a problem arising in 
the method of envelope functions and associated with the boundedness of k space. Let 
us consider the following one-band equation for the envelope functions f(k z ) in k space: 

jH(k z ,k' z )f(k' z )dk' z = ef(k z ) } (13) 

where k z and k' z are bounded by the Brillouin zone. Transforming Eq. ( |T3"D to go over 
to the coordinate representation we obtain, generally speaking, an integral equation. 
The problem consists in the accuracy with which it is possible to obtain a differential 
equation in r space. Let us consider an equation similar to Eq. (|T3"D, in which k z and k' z 
belong to the entire inverse space: 

+oo 

J H(k z ,k' z )g(k' z ) dk' z = eg(k z ). (14) 

— oo 

The Fourier transform of Eq. fli"4]) with the system of equations ( |i~2|) taken into account 
gives a differential equation in the r representation. If the function g(k z ) vanished for 
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k z not in the Brillouin zone, it would also be a solution of Eq. (|T3|), and we would 
solve our problem exactly. In general this is not so. But in order for Eqs. (p~3|) and 
(|T4| ) to be approximately equivalent, it is necessary that g(k z ) be small for k z not in 
the Brillouin zone. In the theory of smooth perturbations this smallness is ensured by 
exponentially decaying envelope functions in the k representation; however, in the case 
of discontinuous perturbations the envelope functions are decreasing functions of k z with 
only a power-law falloff. Thus, if the envelope function possesses one discontinuity, its 
Fourier transform satisfies g(k z ) oc (Sg/g) ■ (A^) -1 for large k z (where the exponential 
contributions associated with the effects of smooth fields have decayed); where (Sg/g) 
is a typical relative discontinuity of the function in r space. If we consider, for exam- 
ple, the standard effective-mass equations |J with discontinuous potentials, then the 
second derivatives of the corresponding envelope functions will be discontinuous with 
characteristic relative discontinuity of the order of unity [again, for states whose ener- 
gies, measured from the band edge of the left-hand or right-hand material, is of the order 
of the band discontinuity] , and the error incurred by using differential equations will be 
of order (k z /K) 3 , where K is the radius of the Brillouin zone along the k z axis. 

In the case of a quantum well of width L it is possible to treat two cases: k z L > 1 and 
k z L <C 1- In the first case the error is of the same order as for a single heteroj unction; in 
the second case it can be of the order (k z L)~ l (k z /K) 3 . This is an upper estimate. For a 
symmetric quantum well in the conduction band, for example, the error depends on the 
sign of the product of the values of the envelope function on the heteroboundaries, and 
for states of the second 2D subband it is overestimated. In the limiting case of a narrow 
quantum well, L < 1/K, the potential can be replaced by a 5-function. Then we obtain 
an envelope function with a discontinuous derivative and error of order (k z /K) 2 . 

Above, in Eq. (^), we tacitly assumed that the Hamiltonian H(k z , k' z ) is defined for 
all k z and k' z , belonging to the Brillouin zone. Since our goal is to obtain the one-band 
equations, we must take one more circumstance into account. In k space near the T 
point there exists a region Ai in which the spectrum of states of the conduction band of 
the bulk material can be written in the form of a series in powers of the quasi-momentum 
(for states of a degenerate band the spectrum is determined by a matrix whose elements 
are such series). The series converges for |k| < 1/(2A), as follows from the two-band 
approximation [again, such an estimate is valid if the effective mass is formed mainly 
by the k ■ p interaction] . This is the region described by a Hamiltonian of the one-band 
equation. There is also a region A 2 , where the interaction of states of isolated bands 
with distant bands principally cannot be described by this series. In our case of sharp 
heteroj unctions the envelope functions in the k representation fall off according to a 
power law; therefore we should also provide a valid description of the region A2, which 
will be done elsewhere in connection with the problem of inter-valley mixing of states 
in heterostructures. Here we only mention that if the ratio of the characteristic band 
offset to the energy gap between the states under study in the region A 1 and the states 
in the region A 2 is a small parameter uCl, then the error incurred by neglecting the 
region A 2 will be less than or of order u(ak z ) 2 . Thus, the effective radius in k space 
determining the accuracy of reducing the integral equation to a differential equation is 
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in fact not determined by the size of the Brillouin zone along the k z axis, but depends on 
which bands are taken to be distant and are "eliminated" by the unitary transformation. 
In our case, this radius is of the order of 1/A. 

Below we will obtain an equation for the conduction band, leaving out details of the 
unitary transformation to k space and at once carrying out the transformation to the r 
representation. Formally, the final equation is a fourth-order differential equation, and 
the envelope function satisfying it in the case of a discontinuous potential, the most 
unfavorable case for accuracy, has a discontinuous second derivative with characteris- 
tic discontinuity of the order of the second derivative itself. It is possible to proceed 
otherwise. Reducing the fourth-order differential equation to a physically equivalent 
second-order equation [ffif| , we obtain a discontinuous envelope function with character- 
istic relative discontinuity of order (k z \) 2 . This means that for a single heteroj unction 
or a wide quantum well the accuracy of the obtained effective-mass equation is limited 
as a result of having to take account of all terms up to order (k z X) 3 (exclusively). In 
the case of a narrow quantum well, on the other hand, even for L ~ A the effective-mass 
equation should include only first-order corrections associated with effects of sharpness 
of the heterojunction, and considering all remaining orders, including those leading to 
spatial dependence of the effective-mass parameters, will yield an excess of accuracy. 
In such a case the short-range potential formalism, already used above to obtain the 
expansion flllD . This will be taken up in Sec. 4.3. 

4 One-band equations 

4.1 Conduction band 
4.1.1 Smooth heterojunction 

The transformation from a many-band system of equations for the envelope functions to 
a single-band equation is realized in the standard way [[!], |27J . For smooth heterojunctions 
the one-band equation for the envelope functions for the c band (conduction band) was 
derived in Ref. [^] . In the r representation it has the form: 

e c0 F c (r) + l -m a ^ (z) pm ft (z) pm ai (z) F c (r) + T (z) AU C F C (r) + 

+ a p 4 F c (r) + p ( P Jp2 + p2 p 2) Fc (r) + rj [p x n] ■ aT' (z) F c (r) = eF c (r) . (15) 

The conduction band offset AU C and the modified form factor of the heterojunction T(z) 
are given by 

r (z) AU C = G (z) 5U CC + J2'^^G 2 (z) , 

so that in all small corrections G(z) can be replaced by T(z). The position-dependent 
effective mass is given by 

m (z) = mi [1 + rn\ (/xi - /x 2 ) T (z)] , 
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and mi is the effective mass of the edge of the conduction band of the left-hand material, 
and for m^-, which is the effective mass of the conduction band of the right-hand material, 
we have 

l/m 2 = 1/mi + fjL 2 — Mi- 
The parameters a% and j3\ are defined as follows: 



I'l 



2 (/i 2 - iii) 
The parameters //i and /x 2 are given by: 



2oti+/3i = -1. 



4<c|pa,|n) (n|pa,|Z) <S£/i 



2 |(c| pa; |n)| 2 £Z7 c , 
mg (e c0 - e n0 ) 2 n j ml (e c0 - e n0 ) (e c0 - e w ) 

^ _ y^/ 2_(c| Pj-Iw) g/gj (/|p a |c) 

Hj i m ( e c0 — Cno) (e c o — Qo) 



(16) 



(17) 



In Eq. ( |15[ ) cto and /?o are the nonparabolicity parameters of the bulk spectrum. Finally, 
the last parameter entering into the equation is 



V 



£' 

n.l 



h 2 (c| p z \n) (n\ [V6U x p] x |Z) (jj p y |c) 
4im^c 2 (e c0 - e n0 ) (e c0 - e zo ) 



In the Introduction it was pointed out that an invalid expression for the effective 
mass of the edge of the conduction band of a non-basis semiconductor was obtained in 
Ref. |14| by direct elimination of the small envelope functions. This corresponds to the 
absence in expression ([16]) of the second sum, and in expression flT7| ) of the terms with 
n ^ I. Thus it is necessary to be careful when using the method of direct elimination of 
small envelope functions. 

The relation of the envelope function of the conduction band F c (r) with the total 
wave function is given by 

(r) = u c0 {1 + 2R mi (r (z) AU C + e c0 - e)} F c (r) + 



UnO 



£ c o — e n o) 



h(n\p\c) .V + 5UncT(z)+ ^(n\j> a \l) (l\p p \c) 



trriQ 



Here 



R 



{c\Vx \n) 



™6 1^0 - QoJ 



9^_^ 2 I \2 ' 

z n m [e C Q — e n o) 



PaP/3 



F c (r) 



(18) 



and the term 2Rm\(T{z) AU C + e c o — e) inside the braces in expression ( |i"8|) comes from 
the term — Rp 2 treated as a perturbation using the standard effective-mass equation. In 
the brackets in expression (jl~8|) we neglected the term 



h (n| [W x x p] |c) • o h(n\ [V5U x p] |c) • a 

r 1 \Z) 



4mgC 2 



4m 2 , c 2 



(19) 
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since the largest contribution to the matrix elements (^) comes from the region of the 
potential near the atomic nuclei in which the spin-orbit interaction operator can be 
written in the form of a product of operators of the electron spin and the orbital angular 
momentum, and the function u c q is spherically symmetric (the orbital momentum is 
zero). 



In Ref . p8| it was shown that for d ■ k z <C 1 Eq. fllq) can be replaced by an equivalent 
equation where the Heaviside step function Q(z — z ) replaces the function T(z), and the 
coordinate Zq assigning the position of the mathematically sharp hetero junction can be 
chosen arbitrarily within the limits —d < zq < d. The method used in Ref. for this 
transformation is not the most convenient. There is a simpler way of transforming to 
a mathematically sharp hetero junction based on the following chain of identities, valid 
for operators acting on smooth functions: 



Z7T 



dk z e 



tf-z 



r (z') e~ ikzZ ' dz' = 6 (z - Zq) + 



+00 



+ 



2tt 



dk z e ik ^ z - zo) f (r {z') -Q{z'- zq)) e ~ ikz(z '- zo) dz' ^e(z-z ) + 



+5 (z - Zq) 



r [z) dz — (d — Zq) 



+ 6'{Z- Zq) 



r (z) (zq — z) dz + 



(d - Zq) 



-d 



Setting Zq = 0, instead of Eq. (|I5D we obtain a more convenient form of the effective-mass 
equation for a smooth heterojunction: 

[e c0 -e + e(z) AU C + AU cPo 6 (z)} F c (r) + X -m^ (z) pm ft (*) prn" 2 (z) F c (r) + 

+ a p 4 F c (r) + Pq ( P y z + p 2 x p 2 y ) F c (r) + rj [p x n] • a5 (z) F c (r) = 0. (20) 

Here 

m (z) = mi [1 + mi {ji x - fi 2 ) O (z)] , 

and the parameter a<i not only depends on the materials of the heterojunction but also 
takes into account its finite width through its dependence on T(z): 



Hx + 2AU c h~ 



«2 



d 2 - J 2T(z)zdz 

-d 



2 (fl 2 - fix) 

The relation 2a 2 + /3 2 = — 1 is preserved, and 

d 



Po 



r (z) dz — d. 



The transformation to a mathematically sharp heterojunction described above is dictated 
only by arguments of convenience, and the explicit form of the function T(z) appears 
only in integral form in the expressions for the two parameters a 2 and po- 
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4.1.2 Sharp hetero junction 



To include corrections associated with the sharpness of the hetero junction in Eq. ( pOj) , it 
is necessary in the standard procedure |], ^7| to take into account the contributions of the 
terms D 0nn /, D lnn i and B 0nn / in the first-order perturbation theory [S 0cc = Si cc = 0, as 
follows from zinc- blende symmetry], and the contributions of the terms D Qnn * and So n n' 
should still be treated in the second-order theory along with the terms hk ■ p nn , /m . 
Utilizing symmetry properties, it is not hard to obtain the following additional term 
Habr to the Hamiltonian of Eq. fll"5| ) : 



H ahr = DqccS 0) + p5' (z) + 7][pxn]-a5 (z) . 



(21) 



Here 



D 



Occ 



-E 



(c | SU cos (Kjz) | c) 



G'{z) sin (Kjz) dz, 



p= E' 



h (c| p z \n) (n | 5U sin (Kjz) \ c) 
iKjrriQ (e c0 - e n0 ) 



G'(z) cos (Kjz) dz+ 



+ E 



(c | 5U cos (Kjz) | c) 



G' 



'cos (Kjz) 



+ z sin (Kjz) I dz, 



h (c| [V (sin (J£jz) <5LQ x p] x |n) (n| p y |c) 
2Kjm%c 2 (e c0 - e n0 ) 



a(c| V z (sin (i^z) |c)' 



G'(z) cos (Kjz) dz. 



Comparing expression (|2l|) with the Hamiltonian of Eq. ( p0|) we see that for the con- 
duction band taking the sharpness of the hetero junction into account does not alter the 
form of the one-band equation and simply renormalizes the constants used in it. If we 
take Eqs. fl20D and (pl|) into account, the desired equation for the conduction band takes 
the form 

(e c0 - e + 9 (z) AU C + d x 8 (z)) F c (r) + l -m a (z) pm" (z) pm a (z) F c (r) + 
+ « oP 4 F c (r) + fa (pjp2 + p2 p j) Fc (r) + d 2 [p x n] • trS (z) F c (r) = 0. (22) 



The term pS'(z) in expression (|2l|) was absorbed by the kinetic-energy operator; as a 
result, for a we obtain 



Hx + 2AU C %~ 



a 



d 2 - f 2T(z)zdz 

-d 

2 (fi 2 - fix) 
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(here the error incurred in Ref. [2J| has been corrected), and 2a + (3 = — 1, with 
d\ = D 0cc + AU c p and d 2 — f] + fj. 

Let us discuss the Hamiltonian of Eq. (0). The first term represents the potential 
energy of an electron in the c band. The possible existence of a heterointerface term 
proportional to d\ was discussed in Ref. ; it is clear that this contribution disappears 
for a mathematically discontinuous heterojunction (in this rather unrealistic case models 
d5) and (H) are identical). The second term is the position-dependent kinetic-energy 
operator, which is quadratic in the momentum; such a form was proposed in Ref. |J 
more generally. Note that the parameter a is not a universal constant but depends both 
on the materials and on the shape of the transitional region of the heterojunction, as 
seems natural even intuitively (see also Ref. ||). If it happens that m(z) = const, i.e., 
A*i = ^2-, indeterminate expressions of the form 1°° arise in the form of the kinetic-energy 
operator used here which are easily evaluated. Such indeterminate forms do not arise 
if we use a different, equivalent form of this operator where we separate out a term 



analogous to the relativistic Darwin term (see Ref. [28 



p 



2m (z 



P 



AC/ C 



2 



T (z) zdz 



+ p\S'(z). 



The third and fourth terms in the Hamiltonian of Eq. (B^) describe corrections to the 
weak nonparabolicity and depend only on the bulk parameters. The fifth term describes 
the interface spin-orbit interaction (see, e.g., Ref. [0), whose strength (d 2 ) depends 
not only on the materials of the heterojunction but also on the shape of the transitional 
region. The relation between the total wave function and the envelope functions of the 
conduction band is given by Eq. ([H3), where we can set T(z) = Q(z). 



4.1.3 Boundary conditions on the envelope function 

From Eq. (^) it is not hard to obtain boundary conditions imposed on the envelope 
function at the heterojunction; to this end it is necessary to reduce Eq. ( |22| ) to a second- 
order differential equation by employing the smallness of the contribution proportional 
to p 4 (see Ref. (EB1). We present only the result: 



where 



and 



F c (r) 



z=+0 



d u 
d 2 i d 22 




dn 



= 1 + 4m?a A[/ c + m%a ( 

\m 2 mi 

, 2m± . , , r , . 
«2i = -T2 - («i + "2 [p x nj ■ a) , 



'22 



m 2 
mi 



Tr 



+ Am\a.Q/S.U c — m^a 



1 

m 2 



1 

m\ 
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In the approximation of the standard effective-mass method we have a position- 
independent effective mass and a discontinuous (step-function) potential. Corrections 
which are first order in the small parameter of the problem are taken into account by 
including in the standard equation a 5-function potential proportional to d% (formally, 
this is a correction of order d-k z ). The complete equation ( p2[ ) also includes all corrections 
of order (Xk z ) 2 . It is not possible to take into account smaller contributions in the one- 
band version of the method of envelope functions because the unavoidable error arising 
in the transformation from the many-band system of integral equations to the one-band 
differential equation is of the same order. 

It is not hard to generalize the above results to the case of an arbitrary heterostruc- 
ture. It is easy to do this proceeding from symmetry arguments for a symmetric quan- 
tum well with two equivalent hetero junctions with coordinates z = and z = L, where 
L = Ma/2, and M is an integer. The effective Hamiltonian is 

H el = e c0 + AU C {Q (z) —Q(z — L)} + \m a (z) pm? (z) pm a (z) + a p A + 
+Po (pjjp' + P*P?) + ^{5 (z) +5(z-L)} + d 2 {5 (z) -6{z-L)}[pxn]-a. 
4.2 Valence band 

The distinction between the effective-mass method for the valence band and the effective- 
mass method for the conduction band consists, in principle, simply of the necessity of 
considering more elaborate equations in the case of the valence band. The main points 
of the problem of deriving the equation for the hole states with position-dependent 
effective- mass parameters were noted in Ref. |^|. The equation for the c band already 
contains three new parameters which depend on the bulk properties of the materials of 
the heterostructure and on the properties of the heteroboundary. For the valence band, 
there is a larger number of such parameters, which may be seen as rendering such an 
equation of little practical value. Therefore we limit ourselves here to a derivation of 
first-order corrections to the standard equation in the above small parameter. 

In the basis {\J, j z )} of eigenfunctions of the total angular momentum J and its 
projection j z , with these eigenfunctions being combinations of the Bloch functions X, 
y and Z of the top of the valence band (transforming according to the representation 
r 15 ) and the spin, the matrix of the effective Hamiltonian for the valence band H in this 
approximation is a sum of the 6x6 matrix of the standard kinetic energy operator T 
(we neglect the small contribution of k-linear bulk terms from the spin-orbit interaction) 
and the 6x6 matrix of the potential energy operator V: 

H = T + V. 

Of course, in the standard approximation V contains only diagonal discontinuous (step- 
function) potentials. Additional terms appear in the approximation that follows which 
are diagonal and non-diagonal 5-function potentials. 
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It is convenient first to find the elements of the potential energy matrix V in the 
basis {X, y, Z}, and then compose from them the necessary linear combinations and 
transform to V. 

Vzz = Vxx = e*o + 8U XX G (z) + D oxx 5 (z) « 
« e xo + SU XX Q (z) + {D oxx + poSU xx ) 5 (z) ; 
V xy = (A + 5 AG (z)) a z + D oxy 5 (z) + S z oxy 5 (z) a z « 



I (A + <5A6 (z)) a 2 + D oxy 5 (z) + (s* xy + ^) 



where 



Analogously, 



A 3M(X\[VU l xp} z \y) ^ 5A 3hi(X\\S76Uxp] z \y) ^ 

Arrive 2 Arrive 2 

V xz = l - (A + SAG (z)) a y + S y 0XZ S (z) a y « 

« i (A + 5A& (z)) a y + (s« xz + % *^\ 6 (z) a y . 

The remaining elements of V can be obtained from those shown above by cyclic permu- 
tation of the indices. 

The contribution of the sixth and seventh terms on the left-hand side of Eq. (|12]), 
as can be seen, is not included in V, since it is negligibly small for the following reason: 

(x | v z u x | y) = o 



(see Ref. |27|], Sec. 21). This means that the matrix element 

(x | vju \y) = (x\ v z u 2 1 y) . 

For the Bloch functions of the nth band at the T point of the right-hand crystal in the 
non-relativistic limit (u n o) we have: 

UnO=UnO + }_^ > ( 23 ) 

n i e c0 — e n'0 

and for the corresponding functions of the edge of the valence band of the right-hand 
crystal X and y the relation [X \ V -JJi \ y) = holds. That is, the seventh term 



on the left-hand side of Eq. flT2"| ) gives corrections only of order (Xk z ) 3 , not Xk z . In 
our approximation it is necessary in general to neglect the difference between the Bloch 
functions u n o and u h q. Hence it follows from invariance of the equation under time 
reversal that the contribution from the sixth term on the left-hand side of Eq. is 
negligibly small. 
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It is not difficult now to obtain the elements of V. We choose the phases the same 
as was done in Ref. and write 



|1> = 



|4> = 



3 3' 
2' 2/' 



|2> 



3 3 
2'~2 



1\ 1 1\ 

'"2/' |5)= 2'2/' 



13} 



16} 



3 r 

2' 2/' 

1 1 

2'~2 



(pairs of states of heavy holes, light holes, and states of the split-off band). Thus, the 
desired potential-energy matrix takes the form 



V 



V a y 

Vr 8 l 





where 



Vr a = E rtt + AU Ta Q{z) + xiS(z) 
V r7 = E T7 + AU rr Q(z) + X2S(z) 

v o = — —° ( z ) ■ 



Here we have introduced the notation 



1 



£r 8 = e*o + 3 A, AU Fs 



SU XX + -6 A, 



2 

i 

3 



E Tr = e xo - -A, AU Vr = 5U X x - -6A, 



2 
—< 

3 



Xi = D oxx + PqSUxx + iSo X y + ^PqSA, 



X2 = D xx + pobUxx - 2iSoxy ~ gPo^A. 
The expressions for D xx, D Q xy and S§ X y have the following form: 



(24) 



D 



oxx 



-E 



(X | SU cos {K jZ ) | X) 



G'(z) sin (Kjz) dz, 



Doxy — E 



{X | 6Usm{Kjz) | y) 



G\z) cos (Kjz) dz, 



(25) 



qz 

°oxy 



E 



h(X\ [V(cos(K jZ )5U) x P ]jy) 



sin (Kjz) dz. 



-d 



Thus, within the framework of the k • p method we have shown that in (001) III-V 
heterostructures mixing of heavy (hh) and light (Ih) holes takes place at the center of the 
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2D Brillouin zone (see Ref. |!3| and the references cited therein) which bears no relation 



to the k 2 -linear bulk terms from the spin-orbit interaction. This mixing is governed by 
the parameter Doxy, which was estimated in Ref. |33| on the basis of experimental data 
for GaAs/AlAs heterostructures: Doxy — 500 meV A. 

In Ref. [[^Jl it was concluded that the strength of the mixing of the heavy and 
light holes at the center of the 2D Brillouin zone is greater for sharp heterojunctions 
than for heterojunctions with smoothly varying chemical composition. But this is valid 
only in model ([I]). Generally speaking, one can draw conclusions only about the depen- 
dence of the strength of this mixing on the structure of the transitional region of the 
heterojunctions. 

For a symmetric quantum well with boundaries z = and z = L the elements of the 
potential energy matrix can be easily obtained from symmetry arguments: 

V r8 = E ra + AU Ta (6 (z) - 6 (z - L)) + X i (5 (z)+5(z-L)), 

Vr 7 = E T7 + AU V7 (6 (z) - 6 (z - L)) + X 2 (S (z) + 5 (z — L)) , 

V = ^(5(z)-8(z-L)). 



4.3 Equation for the envelope function for a narrow quantum 
well in the conduction band 

We will devote separate attention to the problem of electron states in narrow quantum 
wells because at present it is widely held that the effective-mass method is inapplicable 
in such case. Here we treat only states of the c band in a III-V (001) heterostructure 
consisting of related semiconductors of isolated-quantum-well-type (or narrow-barrier- 
type) for the case when its width satisfies L < A. Hole states are easily treated in an 
analogous way. Now the contributions from the sharpness of the potential begin to play 
a much greater role than in the case of a wide quantum well. In fact, the estimate 
k 2 z /2m ~ 5U CC is valid only when the width of the quantum well is greater than or of 
the order of the characteristic localization length of the states. For states in a narrow 
quantum well, on the other hand, of course k 2 z /2m ~ 5U cc k z L, and the contribution to 
the energy eigenvalue from the terms associated with sharpness of the potential can be 
estimated to first order as SU cc k z a, which implies that they must be taken into account 
even in the zeroth approximation. 

Thus, the potential of the heterostructure under consideration can be written as 

U = U 1 +P[U 2 -U 1 ] = U 1 +P (z) SU, 

where U\ and U2 are the periodic crystal potentials of the "barrier" (basis) and "well" 
semiconductor, respectively, and P{z) is the form factor of the heterostructure. We 
choose L so that P{z < —L/2) = P(z > L/2) = 0. It is natural to consider P(z) as a 
local function on the scale of variation of the envelope function of the conduction band 
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since Lk z <C 1. The one-band equation has the following form: 



e co - e + + b x S (z) + b 2 5' (z) + b 3 5 (z) [p x n] • a 
2m 



F r (r) 



0. 



(26) 



where m is the position-independent effective mass, and we have the following expressions 
for the three parameters &j (i — 1, 2, 3): 



L/2 

b\ = (c | 5U cos (Kjz) | c) y P(z) cos (i£jz) <iz; 

-L/2 



(27) 



ft (c| p z |n) (n | 5£7sin (Kjz) \ c) 



L/2 



im {e c0 - e n0 



P(z)sm(Kjz) dz— 



-L/2 



L/2 

(c | 5U cos (Kjz) | c) y P(z) cos (i£yz) 2; dz; 



-L/2 



ft (c| [V (sin (.Kj-z) 5?7) x p] |n) (n| p^ |c) 
2mgc 2 (e c0 - e n0 ) 

L/2 

y P(z) sin (iiC 3 -z) <iz, 



ft(c| V,(sin (K jZ ) 5U) \c) 
ArriQC 2 



-L/2 



in which the summation index j includes zero (the terms with j — represent the 
contribution of the smooth part of the potential). The term in the potential energy 
proportional to 61 gives the main contribution, and the two remaining terms are correc- 
tions of order Xk z . For a symmetric structure, P(z) = P(—z), the equation simplifies: 
b 2 = h = 0. 

Strictly speaking, Eq. ( pp is invalid from a mathematical point of view (it does not 
have a rigorous nontrivial solution); therefore, it needs to be put into a different form 
using the smallness of the term proportional to b 2 , and discarding corrections of order 
(Xk z ) 2 . It is necessary to invoke the approximate relation 



M (z) + b 2 5' (z) ^b 1 5[z + 



(28) 



and in the term proportional to 63, 5(z) can be replaced by 5(z + b 2 jb\) for simplicity. 
We obtain the valid equation 



P 



e co - e + + h6 \z + — + b 3 S [z + — [p x n] ■ a 



2m 



F r (r) 



0. 



(29) 



Note that for states of one band (if we are not interested, for example, in inter-band 
transitions) in a heterostructure with one narrow layer the value of b 2 does not play a 
role: in Eq. ( p6|) we can shift the origin z' = z + b 2 jb\. 
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Assume now that we are dealing with a structure containing two narrow layers lying 
near one another, such that the distance between them is of the order of A. In this case, 
an upper estimate on the error arising from the transformation to a one-band differential 
equation is Xk z (this is valid, in particular, in the case when the constant b\ describing 
the potential of the first layer is equal to the constant describing the potential of the 
second layer taken with opposite sign). Then we should remove from consideration those 
terms containing b 2 and 63 in Eq. (p6|) . 

One could probably treat both layers as one local perturbation, thereby decreasing 
the error, and obtain Eq. (26) with one set of parameters of the local perturbation. 
But in the situation for which we obtained an upper estimate of the error, however, one 
could not then guarantee the smallness of the parameter k z b2/bi on which Eq. (p8|) is 
based. If it is not small, then one could not say that Eq. ( |28| ) is mathematically correct, 
which would imply the inapplicability of such an approach. 

Thus, taking the above modification into account, we can also apply the effective- 
mass method to electron states in heterostructures with super-thin layers. In this regard 
the situation can arise in which the potential of a thick layer of some semiconductor plays 
the role of a barrier for the electron states while a thin layer of the same material will 
couple states and vice versa, depending on the sign of the parameter b\. The sign of this 
parameter, as can be seen from Eq. fl27|), can be different from the sign of the parameter 
(c I SU I c), which defines the conduction band offset at the heteroj unction. 



5 Hierarchy of effective-mass equations and discus- 
sion of results 

We have derived a many-band k • p system of integral equations fll2"D which can be used 
to describe electron states in heterostructures with atomically sharp variation of their 
chemical composition. The system contains contributions in the form of converging 
power series in [k z — k' z ) which are due to the sharpness of the heterojunction. For 
example, such terms were discarded in Refs. fL4| , |19[] , and |2(| so that the effects of 



a discontinuous change in the crystal potential of the structure near a heterojunction 
were in fact neglected. It is specifically the presence of these terms that distinguishes 
the many-band system of k ■ p equations derived here from the system obtained by 
Leibler || for heterostructures with smooth heterojunctions. All of the papers known 
to us which use one-band or many-band schemes of the method of envelope functions 
to describe electron states in heterostructures apply the k • p system of Leibler. Very 
often the distinction between the Bloch functions for the component semiconductors of 
the heterostructure is also neglected, which gives, in particular, 5U nn i = for n 7^ n'. 
Taking into account the terms due to sharpness of the heterojunction, the k • p method 
can also be used to describe inter-valley mixing of states in heterostructures, including 
the problem of T-X z mixing of states in (001) heterostructures. 

The main limitation on the accuracy of the method of envelope functions employing 
differential equations is the procedure of transforming from k to r space. The one-band 
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differential equation (of fourth order) with position-dependent effective mass is valid for 
structures with characteristic width of layers much greater than A, where the length A 
was defined in Sec. 2. 

Above we considered in detail how taking account of contributions of the sharpness 
of the interface potential modifies the equation for states of the conduction band in 
(001) heterostructures of related, lattice-matched III-V semiconductors, derived in Ref. 
||28|| . Formally, the resulting equation for a sharp heterojunction differs from that for a 
smooth heterojunction only by renormalization of the parameters entering into it. In 
the case of the valence band, on the other hand, taking account of the sharpness of the 
heterojunction leads to qualitatively new effects (mixing of heavy and light holes for 
k - 0). 

For heterostructures with wide layers it is possible to construct a hierarchy of approx- 
imations of the one-band method of envelope functions according to the parameter Xk z , 
where k z is the characteristic value of the quasi-momentum of the state. For example, 
for an isolated heterojunction we have the following. 

0) Zeroth level of the hierarchy for electrons. In the effective-mass approximation in 
which small corrections in the order parameter have been neglected, we have the usual 
equation with position-independent effective mass and a discontinuous (step-function) 
potential: 

^e c0 + 6 (z) AU C + |^ F c (r) = eF c (r) . (30) 

1) First level of the hierarchy. First-order corrections are taken into account (here, 
in fact, the small parameter d ■ k z plays a role; 2d is the width of the transitional region 
of the heterojunction) by including a 5-function potential in Eq. fl3"0|), which is localized 
at the heteroboundary: 

^e c0 + 6 (z) AU C + d 1 5 (z) + |lj F c (r) = eF c (r) , 

where d\ is given by the complicated expression in Sec. 4.1. 

2) Second level of the hierarchy. Equation (p2|) includes all corrections of order (Xk z ) 2 . 
Smaller contributions, of third order and higher, cannot be taken into account correctly 
in a one-band version of the method. 

For hole states we obtained, see Eq. (|2~4]) , the first-order corrections to the standard 
equation [that is, the first step of the hierarchy of effective-mass equations for the holes] 
and showed that for (001) heterostructures mixing of heavy and light holes at the center 
of the 2D Brillouin zone does indeed take place and that contributions from the sharpness 
of the heterojunction potential determine the strength of this mixing so that it depends 
on the microscopic structure of the heteroboundaries. In Ref. |2D| it is asserted, in 



particular, that such mixing of heavy and light holes is caused by a difference in the Bloch 
functions for the component semiconductors of the heterostructure and it is absent, if 
one neglects such a difference, or, what should be equivalent, if the Bloch functions of all 
the bulk semiconductors comprising the structure are the same set of functions. If this 
is indeed the case, then the contribution of these terms [even without taking symmetry 
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arguments into account] would be only of order (AA^) 2 , as can be seen from Eq. (f23|). 
In fact, however, the difference in the Bloch functions does not play a substantial role. 
To prove this, consider the idealized situation of a (001) homojunction — the problem of 
hole states in a weak but not smooth external potential, say, W(z) = G(z)Wo, where 
Wo is a constant assigning the jump of the potential, small in comparison with the band 
gap. In this case the point symmetry of the structure, C 2v , also admits the existence of 
mixing of heavy and light holes at the center of the 2D Brillouin zone [Q, and for the 
coefficient governing this mixing, D %y, instead of formula we have 



D oxy = 2^ J? — / G ( z ) cos ( K i z ) dz - 



d 



-d 



This is direct proof of our assertion. 

The independent parameters a, d\ and d,2 introduced in the present work and appear- 
ing in Eq. (p2[), and also xii X2 and D QX y entering into the effective potential energy 
operator for the valence-band states, depend not only on the bulk properties of the ma- 
terials of the heterojunction, but also on its microscopic structure. All these parameters 
determine the heterointerface contribution to the potential energy. At the same time, 
it is well known [34], that as a consequence of the possible appearance of an electric 



dipole moment at the interface of two materials the magnitude of the potential jump 
at the interface can also depend on the microscopic structure of the boundary. This is 
not described in our model of a heterojunction because we do not take into account the 
effect of such a dipole. Including the corresponding discontinuous electrostatic potential 
in Eq. (HD and developing it according to the scheme laid out above also yields the 
desired effect. 

The electron states in heterostructures consisting of thin layers whose thickness is 
less than or of the order of A can be treated only in the approximation quadratic in 
the momentum operator. In this case an account of terms due to the sharpness of the 
potential becomes necessary already in the zeroth approximation. This is clear from Eqs. 
(|27|). In this regard, the following situation is possible: as the width of the quantum 
well is decreased, the bound state can disappear or conversely, a thin layer — nominally 
a "barrier" layer — of some semiconductor can create an attractive potential and form a 
bound state. It is possible that just such a situation was observed in Ref. |35j and then 
modeled in Ref. fl36|. 
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Appendix A: The Dirac equation with position-vari- 
ant gap 

Let us consider a model Dirac equation with a position-dependent gap 2m(r)c 2 



m (r) c ca ■ p 
c<x ■ p — m (r) c 2 



where tp e and tp p are the electron and positron components of the wave function, respec- 
tively. Let m(r) vary weakly in space, i.e., m(r) = rh + 5m(r), so that 5m(r)/m <C 1. 
With the help of a Foldy-Wouthuysen unitary transformation it is quite simple, for 
example, following the scheme laid out in Ch. 20, Sec. 33 of Ref. [24|, to obtain a 
one-band equation describing the states of the electron. Thus, the equation in which all 
small terms have been discarded is the ordinary Schrodinger equation 



m{r)c2+ k 



where tp e is the transformed electron wave function. The equation, on the other hand, 
in which all terms of higher order than 5m(r)/m have been neglected has the form 



H<f e 



H 



m (r) c 2 + ~—~ 

2m 



pdm (r) p /i 2 V 2 5m (r) 



% [V5m (r) x p] 



(Al) 

2m 2 8m 2 8m 3 c 2 4m 2 

All terms in Eq. (Al) with the exception of the third which describes the position- 
dependent mass can be taken as "ordinary." The second, third, and fourth terms (the 
fourth is the Darwin term) can either be written in the following form: 



1 1 h 2 V 2 5m (r) 

P + - 



T 2 = -P / \ 

2 m (r) 



im 2 



or combined into one, quadratic in the momentum, kinetic energy operator: 



To = - 



1 1 



1 



1 



2 a 



m ir 



m ir 



^m(r) 



or some other equivalent form can be used. For example, Ref. |2T1 uses the following 
form for T 2 : 



T 2 = i 
4 



1 



1 



P 



m (r 



+ 



1 



m r 



=P 



m (r 



\f™ (r) 



P 



Thus, in the model Dirac equation with position-dependent gap the concept of a position- 
dependent effective mass shows up only beyond the frames of the non-relativistic (qua- 
dratic) approximation. 
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Figure 1: Projection of the bulk Brillouin zone (the region bounded by the octagon) and 
sites of the inverse lattice (denoted by asterisks) onto the (001) plane. The square with 
diagonal 27r/a bounds the region where there are no 2D transfer processes. 

Appendix B: Regarding transfer processes in the two- 
dimensional Brillouin zone for a (001) heterostructure 

Let us consider the second sum in Eq. ([|) describing transfer processes in the two- 
dimensional Brillouin zone and prove that it does not contribute in the case of interest 
to us of states near the Brillouin zone center in (001) heterostructures. Since the function 
G(q) is nonzero for any q, there also exist nonzero vectors of the inverse lattice K,, for 
which k|| — k|| = Kyj. Here kj, and k|| are components of vectors belonging to the 
Brillouin zone. There exits a finite number of vectors Kj possessing this property. 
Therefore, in general we should also retain the second sum in expression (Q). Let us now 
consider the interesting case of a (001) heterostructure. The octagon in Fig. [I] represents 
the projection of the bulk Brillouin zone onto the (001) plane, and projections of sites 
of the lattice are denoted by asterisks. The function ,F^(&^,kj|) on which the operator 
M. nn i (k, k') acts in Eq. ([|) is defined only for kfi belonging to the projection of the bulk 
Brillouin zone onto the (001) plane. But since all sites K||j for j ^ lie outside this 
projection, there exists a region of k|| for which kn — k|| < K^-, j ^ 0. This region is 
defined by the inequality \k x \ + \k y \ < n/a and is indicated in the figure by the square 
with diagonal 27r/a. The area of this region is one-fourth that of the area of the first 
2D Brillouin zone (the square with diagonal 4n/a). In the region \k x \ + \k y \ < n/a 
the second sum in expression (f|) does not contribute to the equations for the envelope 
functions. Its larger dimensions ensure satisfaction of the conditions of applicability of 
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the derived equations for the envelope functions describing states near the T point. 
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